Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving

Add code
Feb 02, 2026
Viaarxiv icon

Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants

Add code
Jan 14, 2026
Viaarxiv icon

Spatial Multi-Task Learning for Breast Cancer Molecular Subtype Prediction from Single-Phase DCE-MRI

Add code
Jan 11, 2026
Viaarxiv icon

TokenSeg: Efficient 3D Medical Image Segmentation via Hierarchical Visual Token Compression

Add code
Jan 08, 2026
Viaarxiv icon

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

Add code
Oct 02, 2025
Viaarxiv icon

Erased, But Not Forgotten: Erased Rectified Flow Transformers Still Remain Unsafe Under Concept Attack

Add code
Oct 01, 2025
Viaarxiv icon

MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training

Add code
Sep 26, 2025
Figure 1 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 2 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 3 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Figure 4 for MimicDreamer: Aligning Human and Robot Demonstrations for Scalable VLA Training
Viaarxiv icon

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation

Add code
Sep 26, 2025
Figure 1 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 2 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 3 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Figure 4 for EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation
Viaarxiv icon

EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer

Add code
Sep 26, 2025
Figure 1 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 2 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 3 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 4 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Viaarxiv icon

ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction

Add code
Aug 11, 2025
Figure 1 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Figure 2 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Figure 3 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Figure 4 for ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
Viaarxiv icon